Main
Fayaz Seifuddin
Background
Experienced bioinformatics leader with 15+ years in computational genomics, data science, and machine learning, driving innovation in multi-omics research
N/A
N/A
N/A
Proven track record in developing and overseeing bioinformatics strategies, optimizing data processing pipelines for genomics, transcriptomics, and epigenomics
N/A
N/A
N/A
Expertise in managing large-scale NGS datasets, ensuring data integrity, reproducibility, and actionable insights for translational and clinical applications
N/A
N/A
N/A
Strong background in leading cross-functional teams, collaborating with researchers, clinicians, and data scientists to advance bioinformatics initiatives
N/A
N/A
N/A
Published scientist with a history of driving impactful research, translating complex genomic data into meaningful discoveries
N/A
N/A
N/A
Employment
Senior Scientist in Computational Genomics
Bristol Myers Squibb
Lawrenceville, NJ
Current - 2021
Bioinformatics Staff Scientist
National Institutes of Health (NIH), National Heart, Lung, and Blood Institute (NHLBI)
Bethesda, MD
2021 - 2019
Senior Bioinformatics Research Data Analyst
Mood Disorders Center, Department of Psychiatry and Behavioral Sciences, The Johns Hopkins University, School of Medicine
Baltimore, MD
2016 - 2012
Bioinformatics Research Data Analyst
Mood Disorders Center, Department of Psychiatry and Behavioral Sciences, The Johns Hopkins University, School of Medicine
Baltimore, MD
2012 - 2008
Bioinformatics Intern
J. Craig Venter Institute (JCVI)
Rockville, MD
2008 - 2007
Education
PhD., Bioinformatics and Computational Biology
George Mason University
Fairfax, VA
2019 - 2016
- Dissertation: lncRNAKB- A Knowledgebase of Tissue-specific Functional Annotation and Trait Association of Long Noncoding RNA
M.S., Bioinformatics and Computational Biology
George Mason University
Fairfax, VA
2008 - 2006
- Thesis: Identification of Lung Adenocarcinoma Progression Biomarkers
B.S., Computer Science (minor-Mathematics)
Linfield University
McMinnville, OR
2005 - 2001
Industry & Academic Experience
Senior Scientist in Computational Genomics
Bristol Myers Squibb
Lawrenceville, NJ
Current - 2021
- Leading NGS data analysis for clinical trials, exploratory, and research studies, deriving actionable insights for precision medicine
- Developing and optimizing bioinformatics pipelines, integrating AWS, Docker, CWL, in Seven Bridges and Arvados platforms for scalable, reproducible workflows
- Directing innovative research projects on Microsatellite Instability (MSI), alternative splicing analysis, neoantigen prediction, Tumor Mutational Burden (TMB), genetic ancestry analysis, cfDNA nucleosome profiling and RNA-Seq lite in clinical applications
- Designing novel tools and workflows to support long-read sequencing (PacBio) data and advanced transcriptomics analyses
Bioinformatics Staff Scientist & PhD. Research Fellow (in Bioinformatics and Computational Biology)
National Institutes of Health (NIH), National Heart, Lung, and Blood Institute (NHLBI)
Bethesda, MD
2021 - 2016
- RNA-Seq (Transcriptomics): Led large-scale transcriptomic analyses (GTEx, 11,000+ samples), developing HPCC pipelines for expression quantification, differential analysis, pathway enrichment, eQTL and sQTL mapping
- DNA-Seq (Genomics): Designed and implemented HPCC workflows for WGS, WES, targeted DNA-seq, and CRISPR studies, including germline/somatic mutation calling, CNV detection, and mutational burden analysis
- Methyl-Seq (Epigenomics): Built WGBS pipelines for genomic and cfDNA analysis, performing CpG methylation analysis, deconvolution for tissue-of-origin identification, and processing ChIP-Seq, CUT&Tag, and CUT&RUN data
- Proteomics: Applied machine learning models (random forest and mixed-effects modeling) to predict cardiovascular risk using high-density lipoprotein proteome data
- lncRNA Analysis: Developed lncRNAKB (https://osf.io/ru4d2/), a knowledgebase integrating lncRNA tissue-specificity and disease associations, incorporating cis-eQTL, GWAS associations (UK Biobank), and co-expression modules from GTEx RNA-Seq data
Senior Bioinformatics Research Data Analyst
Mood Disorders Center, Department of Psychiatry and Behavioral Sciences, The Johns Hopkins University, School of Medicine
Baltimore, MD
2016 - 2012
- Led a systematic review and mega-analysis of microarray gene expression studies in brain tissue for mood disorders
- Developed Metamoodics, a web-based bioinformatics platform integrating genome-wide linkage, expression, and association data for Major Depression and Bipolar Disorder
- Conducted polygenic risk score (PRS) analysis, linking genetic predisposition to stressful life events and depressive symptoms in older adults
- Designed and implemented HPCC pipelines for epigenetic data analysis, including Whole Genome Bisulfite Sequencing (WGBS), to study mood disorder associations
Bioinformatics Research Data Analyst
Mood Disorders Center, Department of Psychiatry and Behavioral Sciences, The Johns Hopkins University, School of Medicine
Baltimore, MD
2012 - 2008
- Designed and executed scalable bioinformatics pipelines on High-Performance Computing Clusters (HPCCs) for GWAS analysis of mood disorders, integrating Illumina & Affymetrix array data
- Led imputation and statistical analysis of GWAS data, enhancing the resolution of genetic association studies
- Conducted a comprehensive meta-analysis of candidate gene studies in Bipolar Disorder, identifying statistically significant polymorphisms
- Applied machine learning and data mining techniques to analyze high-dimensional genomic data from psychiatric disorders
- Developed analytical workflows for genome-wide DNA methylation (DNAm) analysis
Bioinformatics Intern
J. Craig Venter Institute (JCVI)
Rockville, MD
2008 - 2007
- Developed SMURF (http://smurf.jcvi.org/index.php), a web-based tool for genomic mapping of fungal secondary metabolite clusters, enabling systematic prediction of biosynthetic gene clusters
- Designed algorithms to analyze genomic sequences and identify gene clusters responsible for biosynthesis, export, and transcriptional regulation of secondary metabolites
- Enhanced data-driven insights into fungal metabolomics, providing researchers with a streamlined approach to studying secondary metabolism
Teaching Experience
Trainer, Bioinformatics Analysis for NGS Data
The NHLBI Workshop Series
Bethesda, MD
2019
- Trained 20 staff scientists on NGS platforms and data analysis techniques
- Delivered sessions on data quality assessment (FastQC), preprocessing (Trimmomatic, Cutadapt), and alignment (BWA, Bowtie2)
- Taught variant calling (GATK, samtools) and annotation (Annovar, VEP)
- Guided gene expression analysis (Salmon, Kallisto, DESeq2, edgeR)
- Introduced data visualization tools (IGV, R/Bioconductor)
- Covered functional enrichment and pathway analysis (DAVID, GOseq)
- Emphasized best practices, troubleshooting, and validation in NGS analysis
Guest Lecturer
George Mason University, Department of Bioinformatics and Computational Biology
Fairfax, VA
2019 - 2017
- Lectured M.S. and Ph.D. students on current trends, methods, and analysis of biological sequence and NGS functional genomics data
- Engaged and instructed a class of 15 students
Tutor Coordinator & Mentor
Dallas College North Lake Campus, TRiO Upward Bound
Dallas, TX
2006 - 2005
- Hired, trained, and mentored a team of 10 tutors for the Upward Bound program, supporting first-generation and low-income high school students in college preparation
Teaching & Lab Assistant
Linfield University, Department of Computer Science
McMinnville, OR
2005 - 2004
- Designed, assisted, and graded C++ and Java programming assignments for 20 first- and second-year students in Programming and Data Structures courses
Tutor
Linfield University, TRiO Upward Bound
McMinnville, OR
2005 - 2001
- Tutored 60 students in Math and Science, focusing on Algebra I & II, Geometry, Pre-Calculus, and Calculus
Honors and Awards
NIH Pre-Doctoral Intramural Research Training Award (IRTA)
NHLBI
Bethesda, MD
2019 - 2016
Graduate Student Research Award in Bioinformatics / Biostatistics / Epidemiology / Computational & Systems Biology
NIH, Graduate Student Research Symposium
Bethesda, MD
2019
Outstanding Digital Presentation Award in Bioinformatics / Computational & Systems Biology
George Mason University, Student Research Day
Fairfax, VA
2019
Reviewers Choice Poster Abstracts, Bioinformatics & Computational Biology
The American Society of Human Genetics (ASHG)
Houston, TX
2019
Sumie Yamamoto Endowed International Students Scholarship
Linfield University
McMinnville, OR
2005 - 2001